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Abstract 

Consider a two-treatment, two-period crossover trial, with responses that are 
continuous random variables. We find a large-sample frequentist 1 — a confidence 
interval for the treatment difference that utilizes the uncertain prior information 
that there is no differential carryover effect. 
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1. Introduction 

We consider a two-treatment two-period crossover trial, with responses that are 
continuous random variables. This design is very popular in a wide range of medical 
and other applications, see e.g. Jones and Kenward (1989) and Senn (2006). The 
purpose of this trial is to carry out inference about the difference 9 in the effects of 
two treatments, labelled A and B. Subjects are randomly allocated to either group 
1 or group 2. Subjects in group 1 receive treatment A in the first period and then 
receive treatment B in the second period. Subjects in group 2 receive treatment B 
in the first period and then receive treatment A in the second period. This design 
is efficient under the assumption that there is no differential carryover effect. It is 
not an appropriate design unless there is strong prior information that this assump- 
tion holds. However, a commonly occurring scenario is that it is not certain that 
this assumption holds. We consider this scenario. To deal with this uncertainty, 
it has been suggested (starting with Grizzle, 1965, 1974 and endorsed by Hills and 
Armitage, 1979) that a preliminary test of the null hypothesis that this assumption 
holds be carried out before proceeding with further inference. If this test leads to 
acceptance of this null hypothesis then further inference proceeds on the basis that 
it was known a priori that there is no differential carryover effect, ff, on the other 
hand, this null hypothesis is rejected then further inference is based solely on data 
from the first period, since this is unaffected by any carryover effect. In a landmark 
paper. Freeman (1989) showed that the use of such a preliminary hypothesis test 
prior to the construction of a confidence interval with nominal coverage 1 — a leads 
to a confidence interval with minimum coverage probability far below 1 — a. For 
simplicity, Freeman supposes that the subject variance and the error variance are 
known. In other words. Freeman presents a large-sample analysis. Freeman's con- 
clusion that the use of a preliminary test in this way 'is too potentially misleading 
to be of practical use' is now widely accepted (Senn, 2006). Freeman's finding is 
consistent with the known deleterious effect of preliminary hypothesis tests on the 
coverage properties of subsequently-constructed confidence intervals in the context 
of a linear regression with independent and identically distributed zero-mean nor- 
mal errors (see e.g. Kabaila, 2005; Kabaila and Leeb, 2006; Giri and Kabaila, 2008; 
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Kabaila and Giri, 2008). 

A Bayesian analysis that incorporates prior information about the differential 
carryover effect is provided by Grieve (1985, 1986). However, there is currently no 
valid frequentist confidence interval for the difference 9 of the two treatment effects 
that utilizes the uncertain prior information that there is no differential carryover ef- 
fect. Similarly to Hodges and Lehmann (1952), Bickel (1983, 1984), Kabaila (1998), 
Kabaila and Giri (2007ab), Farchione and Kabaila (2008) and Kabaila and Tuck 
(2008), our aim is to utilize the uncertain prior information in the frequentist infer- 
ence of interest, whilst providing a safeguard in case this prior information happens 
to be incorrect. We follow Freeman (1989) and assume that the between-subject 
variance and the error variance are known. As already noted, this corresponds to 
a large-sample analysis. The usual 1 — a confidence interval for 9 based solely on 
data from the first period is unaffected by any differential carryover effect. We use 
this interval as the standard against which other 1 — a confidence intervals will be 
assessed. We therefore call this confidence interval the 'standard 1 — a confidence 
interval'. We assess a 1 — o; confidence interval for 9 using the ratio 

expected length of this confidence interval 
length of the standard 1 — a confidence interval. 

We call this ratio the scaled expected length of this confidence interval. We find 
a new 1 — a confidence interval that utilizes the uncertain prior information that 
the differential carryover effect is zero, in the following sense. This new interval has 
scaled expected length that (a) is substantially less than 1 when the prior information 
that there is no differential carryover effect holds and (b) has a maximum value that 
is not too large. Also, this confidence interval coincides with the standard 1 — a 
confidence interval when the data strongly contradicts the prior information that 
there is no differential carryover effect. Additionally, this confidence interval has the 
attractive feature that it has endpoints that are continuous functions of the data. 

The properties of the new large-sample confidence interval, described in Section 
2, are illustrated in Section 3 by a detailed analysis of the case that the between- 
subject variance and the error variance are equal and 1 — a — 0.95. In Section 2, 
we define the parameter 7 to be the differential carryover divided by the standard 
deviation of the least squares estimator of the differential carryover. As proved in 
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Section 2, the coverage probability of the new confidence interval for 6 is an even 
function of 7. The top panel of Figure 2 is a plot of the coverage probability of 
the new 0.95 confidence interval for ^ as a function of 7. This plot shows that 
the new 0.95 confidence interval for 9 has coverage probability 0.95 throughout the 
parameter space. As proved in Section 2, the scaled expected length of the new 
confidence interval for 9 is an even function of 7. The bottom panel of Figure 2 is a 
plot of the square of the scaled expected length of the new 0.95 confidence interval 
for ^ as a function of 7. When the prior information is correct (i.e. 7 = 0), wc 
gain since the square of the scaled expected length is substantially smaller than 1. 
The maximum value of the square of the scaled expected length is not too large. 
The new 0.95 confidence interval for 9 coincides with the standard 1 — a confidence 
interval when the data strongly contradicts the prior information. This is refiected 
in Figure 2 by the fact that the square of the scaled expected length approaches 1 
as 7 — > cxD. 

In Section 4, we compare the two-period crossover trial with a completely ran- 
domized design with the same number of measurements of response, using a large 
sample analysis. We assume that the new 0.95 confidence interval is used to sum- 
marise the data from the two-period crossover trial. We show that the uncertainty 
in the prior information that there is no differential carryover effect has the follow- 
ing consequence. Subject to a reasonable upper bound on how badly the new 0.95 
confidence interval can perform relative to the usual 0.95 confidence interval for 9 
based on data from the completely randomized design, the completely randomized 
design is better than the two-period crossover trial for all (subject variance) / (error 
variance) G (0, 11.6263]. In Section 5 we describe the implications for finite samples 
of the results described in Sections 3 and 4. 

2. New Icirge-sample confidence interval utilizing prior information about 
the diff'erential carryover effect 

We assume the model for the two-treatment two-period crossover trial put for- 
ward by Grizzle (1965), as described by Grieve (1987). Let rii and 712 denote the 
number of subjects in groups 1 and 2 respectively. Also let Yijk denote the response 
of the jth subject in the ith group and the kth period {i — 1,2; j — 1,...,^^; 
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k = 1,2). The model is 

Yijk ^ IJ, + ^ij + TTk + (l)e + Xe + Eijk 

where /i is the overall population mean, is the effect of the jth patient in the ith 
group, TTfc is the effect of the kth period, 0^ is the effect of the ^th treatment, Xe is 
the residual effect of the ith treatment and Eijk is the random error. We assume that 
the and Eijk are independent and that the are identically N{0, a^) distributed 
and the E^jk are identically A^(0, af) distributed, where > and > 0. Let 
m = {l/ni) + (1/77,2), cr"^ = <J^ + (Jg and p = (Js/{cr^ + erf). The parameter of 
interest is 9 — (pi — (f)2- The parameter describing the differential carryover effect is 
ip — {Xi — X2)/2. We suppose that we have uncertain prior information that = 0. 

We use the notation yj.fe = (l/jT-i) Yl^Li^ijk {i — 1,2). Our statistical analysis 
will be described entirely in terms of the following random variables: A = (Y^.i — 

F1.2 - F2.1 + F2.2)/2, ^ = (Fi.i + F1.2 - F2.I - F2.2)/2, 
1 / "1 "2 

\j=i j=i 
These random variables are independent and they have the following distributions: 
Ar.N{e- ^,mal/2), i> ~ 7V(V;,m(<72 + 2al)/2), V/al ~ x^,+n.-2 and W/{al + 
2(7's) ~ Xni+n2-2- Define © = A + * = Yi.i — Yz-i- This estimator of 9 is based 
solely on the data from period 1. Consequently, it is not infiuenced by any carryover 
effects. Note that 



N 



m mp^ 
mp^ mp^ 



(1) 



where p denotes the correlation between © and ^ and is equal to ^/(l + p)/2. 

We follow Freeman (1989) and assume that the subject variance al and the 
error variance are known. This implies that the parameters and p arc known. 
Using the random variables V and W in the obvious way, al and cr^ can be estimated 
consistently as rii + 712 — > 00. In other words, we are using a large-sample analysis. 

We use the notation [a ± h\ for the interval [a — 6, a + 6] (6 > 0). Define Cq, = 
$~^(1 — f), where $ denotes the A'"(0, 1) cumulative distribution function. The 



usual 1 — a confidence interval for 6, based solely on data from the first period, is 
/ = [O ± CQ-y/mcr] . Define the following confidence interval for 9: 

J(b, s) = 6 — V^cr b I =r ± \/mcr S I =r , 

\ay/mpj \ay/mpj_ 

where the functions b and s are required to satisfy the following restriction. 

Restriction 1 6 : M ^ M is an odd function and s : [0, oo) [0, oo). 

Invariance arguments, of the type used by Farchione and Kabaila (2008), may be 

used to motivate this restriction. For the sake of brevity, these arguments are 

omitted. We also require the functions b and s to satisfy the following restriction. 

Restriction 2 b and s are continuous functions. 

This implies that the endpoints of the confidence interval J{b, s) are continuous 
functions of the data. Finally, we require the confidence interval J{b, s) to coincide 
with the standard 1 — a confidence interval I when the data strongly contradict the 
prior information. The statistic \4/\/{a^/Tnp) provides some indication of how far 
away il)/{a^/mp) is from 0. We therefore require that the functions b and s satisfy 
the following restriction. 

Restriction 3 b{x) = for all |x| > d and s{x) = for all x > d, where d is a 
(sufficiently large) specified positive number. 

Define 7 = i)/{a^p), G = {Q - e)/{a^) and H = i!/{a^p). It follows 
from ([H) that 

(2) 
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It is straightforward to show that the coverage probability P(6 E J{b,s)) is equal 

to P{£iH) <G< u{H)) where i{h) = b{h) - s{\h\) and u{h) = b{h) + s{\h\). For 

given 6, s and p, this coverage probability is a function of 7. We denote this coverage 

probability by 0(7; 6, s, p). 

Part of our evaluation of the confidence interval J(6, s) consists of comparing it 

with the standard 1 — a confidence interval I using the criterion 

expected length of J(6, s) 
length of / 

We call this the scaled expected length of J{b,s). This is equal to E{s{\H\))/ca- 
This is a function of 7 for given s. We denote this function by 6(7; s). Clearly, for 
given s, 6(7; s) is an even function of 7. 
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Our aim is to find functions b and s tliat satisfy Restrictions 1-3 and such that 
(a) the minimum of 0(7; b, s, p) over 7 is 1 — a and (b) 



(e(7; s) - 1) di^i^) (4) 

is minimized, where the weight function u has been chosen to be 

z/(a:) = ux + Tt{x) for all x G M, 

where is a specified nonnegative number and Ti. is the unit step function defined by 
T-C{x) = 1 for X > and 7^(x) = for x < 0. The larger the value of u, the smaller 
the relative weight given to minimizing 6(7; s) for 7 = 0, as opposed to minimizing 
e(7; s) for other values of 7. 

The following theorem (cf. Kabaila and Giri, 2007a) provides computationally 
convenient expressions for the coverage probability and scaled expected length of 
J{b,s). 

Theorem 2.1 

(a) DeGne the functions k\h, 7, p) = A (— Cq, c^; p{h — 7), 1 — p^) and k{h, 7, p) = 
A{i{h),u{h); p{h - 7), 1 - p^), where A{x,y; p,v) = P{x < Z < y) for Z N{p,v). 
The coverage probabiUty P[9 E J{b, s)) is equal to 

r-d 

(1 - a) + / {k{h, 7, p) - k\h, 7, p)) (l){h - 7) dh, (5) 

J-d 

where (p denotes the N{0, 1) probabihty density function. For given b, s and p, 
0(7; 6, s, p) is an even function of 7. 

(b) The scaled expected length of J{b, s) is 



1 



Cn 



d 



e{r, s) = l + - {s{\h\)-c^) <j){h - 7) dh. (6) 



d 



Substituting ([6]) into (jl]) we obtain that (jl]) is equal to 

POO pd 2 ' 



is{\h\)-Ca)(j)ih--f)dhdiy{-f) = — - c«) (cj + rf/i. (7) 

Co Jo 

For computational feasibility, we specify the following parametric forms for the 
functions b and s. We require 6 to be a continuous function and so it is necessary that 
6(0) = 0. Suppose that xi, . . . ,Xq satisfy = xi < X2 < ■ ■ ■ < Xq = d. Obviously, 



b{xi) = 0, b{xq) = and s{xq) = Ca- The function b is fully specified by the vector 
(b{x2), ■ ■ ■ ,b{xq-i)) as follows. Because b is assumed to be an odd function, we 
know that b{—Xi) = —b{xi) for i = 2, . . . , g. We specify the value of b{x) for any 
X G [— d, d\ by cubic spline interpolation for these given function values, subject to 
the constraint that b'{—d) = and b'{d) = 0. We fully specify the function s by the 
vector [s{xi), . . . , s{xq-i)) as follows. The value of s{x) for any x E [0, d] is specified 
by cubic spline interpolation for these given function values (without any endpoint 
conditions on the first derivative of s). We call Xi,X2, ■ ■ - Xq the knots. 

To conclude this section, the new 1 — a confidence interval for 6 that utilizes the 
prior information that = is obtained as follows. For a judiciously-chosen set of 
values of d, u and knots Xi, we carry out the following computational procedure. 

Computational Procedure Compute the functions b and s, satisfying Restrictions 
1-3 and taking the parametric forms described above, such that (a) the minimum 
over 7>0of (jnDisl — a and (b) the criterion ([7]) is minimized. Plot 6^(7; s), the 
square of the scaled expected length, as a function of 7 > 0. 

Based on these plots and the strength of our prior information that = 0, we choose 
appropriate values of d, uj and knots Xj. The confidence interval corresponding to 
this choice is the new 1 — a confidence interval for 9. For given cj, the functions 
b and s can be chosen to be functions of p, since p is assumed to be known. All 
the computations for the present paper were performed with programs written in 
MATLAB, using the Optimization and Statistics toolboxes. 

3. Illustration of the properties of the new confidence interval 

The parameter p lies in the interval (l/y^, l). To illustrate the properties of 
the new 1 — a confidence interval for 9, consider the case that (rl/al = 1, so that 
p = V3/2. Suppose that 1 — a = 0.95. We have followed the Computational 
Procedure, described in the previous section, with d = 6, u = 0.2 and evenly-spaced 
knots at 0, 6/8, . . . , 6. The resulting functions b and s, which specify the new 0.95 
confidence interval for 9, are plotted in Figure 1. The performance of this confidence 
intervals is shown in Figure 2. When the prior information is correct (i.e. 7 = 0), 
we gain since e^(0; s) = 0.8527. The maximum value of 6^(7; s) is 1.1239. This 
confidence interval coincides with the standard 1 — a confidence interval for 9 when 
the data strongly contradicts the prior information, so that 6^(7; s) approaches 1 as 
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7 — ^ C)0. 

The value of u = 0.2 was obtained from the following search. Consider u = 0.05, 
0.2 , 0.5 and 1. The Computational Procedure was applied for each of these values. 
As expected from the form of the weight function, for each of these values of cu, 
6^(7; s) is minimized at 7 = 0. For a given value of A, define the 'expected gain' 
to be (1 — e^(0;s)) and the 'maximum potential loss' to be (max^e^(7;s) — l). 
As shown in Table 1, as a; increases (a) the expected gain decreases and (b) the 
ratio (expected gain) / (maximum potential loss) increases. By choosing cu — 0.2 we 
have both a reasonably large expected gain and a reasonably large value of the ratio 
(expected gain)/(maximum potential loss). 



CO 


0.05 


0.2 


0.5 


1 


expected gain 


0.2173 


0.1473 


0.0904 


0.0542 


maximum potential loss 


0.2982 


0.1239 


0.0595 


0.0324 


(expected gain) / (maximum potential loss) 


0.7288 


1.1892 


1.5206 


1.6704 



Table 1: Performance of the 0.95 confidence interval for d — 6 and evenly-spaced 
knots at 0, 6/8, ... , 6, when we vary over u; e {0.05, 0.2, 0.5, 1}. 



4. Compcirison of the two-period crossover trial with a completely 
randomized design with the same number of measurements of response 

For the two-period crossover trial, the total number of measurements of response 
is 2M, where M = rii + 77,2. Following Brown (1980), we compare this design 
with a completely randomized design with the same total number of measurements 
of the response. For the completely randomized design, we have M randomly- 
chosen subjects given treatment A and M randomly-chosen subjects given treatment 
B. Let Y^, . . . , denote the responses for the M subjects given treatment A 
and let Yf , . . . ,Y^ denote the responses for the M subjects given treatment B. 
A model for these responses that is consistent with the model used for the two- 
period crossover trial is the following. Suppose that Y/^, . . . , Y^, Yf, . . . , Y^ are 
independent random variables, with Y^,...,Y^j identically A^(0i,a^) distributed 
and Y^ ^ . . . , Y^ identically N{(f)2, o"^) distributed. The usual estimator of = 0i— 02 
is e = ((Fi^ + --- + Y^)-{Yf + ■■■ + Y^)) /M. Obviously 9 - N{e, 2a'^/M). 
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Figure 1: Plots of the functions b and s for af/a^ = 1 and 1 — a = 0.95. These func- 
tions were obtained using d = 6, u = 0.2 and evenly-spaced knots Xi at 0, 6/8, ... , 6. 
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Figure 2: Plots of the coverage probability and 6^(7; s), the squared scaled expected 
length, ^as functions of 7 = il'/ \j var(^) j of the new 0.95 confidence interval for 

6 when crl/al = 1. These functions were obtained using d = 6, u = 0.2 and 
evenly-spaced knots Xi at 0, 6/8, . . . , 6. 
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Now, following Brown (1980), consider the case that there is no differential car- 
ryover effect i.e. that ip = 0. In this case, 6 is estimated by A ~ N{9,'ma'^/2). 
Thus 

var(y4) rii + n2 f I 1 \ 1 
var(e) ~ 4 Ui ^ 1 + (cr.Vc^e) ' 
As this expression shows, the efficiency of the two-period crossover trial, relative to 

the completely randomized design, is an increasing function of cr^/o"^. For the case 
Hi = n2 = n, the two-period crossover trial is more efficient than the completely 
randomized design for all crl/al > 0. In other words, if we are absolutely certain 
that there is no differential carryover effect then we should always use the two-period 
crossover trial, as opposed to the completely randomized design. 

However, as noted in the introduction, it is commonly the case that it is not 
certain that there is no differential carryover effect. We ask the following question. 
What is the efficiency of the two-period crossover trial relative to the completely 
randomized design in this case? We consider this question in the context that o"^ 
and 0"^ are known. In other words, we consider this question in the context of large 
samples. We also assume that the new 1 — a confidence interval described in Section 
2 is used to summarise the data from the two-period crossover trial. For simplicity 
suppose that rii = n2 = n. Based on data from a completely randomized design, 
that usual 1 — a confidence interval for ^ is = [G ± CqO"/ ^/n\ . In earlier sections, 
we have assessed the new 1 — a confidence interval using the scaled expected length 
criterion ([3]), denoted by e(7;s). To compare the two-period crossover trial with 
a completely randomized design with the same total number of measurements, we 
now use the criterion 

^ _ ^ expected length of J{b, s) 
' length of K 

Note that r(7; s) = -\/2e(7;s), so that r^(7; s) = 2 6^(7; s). For a given value of 

p G (l/-\/2, 1), let us restrict attention to the class C(p, 1 — a) of new 1 — a confidence 

intervals that satisfy the constraint max^e^(7; s) < 1.25, so that 

maxr^(7; s) < 1.5. (8) 
7 

This condition puts an upper bound on how badly the new 1 — a confidence interval 
can perform relative to the confidence interval K based on data from the completely 
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randomized design. Consider the particular case that 1 — a = 0.95. For each 
p E (1/v^, 0.98], i.e. for each <j1/al E (0,11.6263], we find computationally that 
min^r^(7; s) > 1 for every new 1 — a confidence interval belonging to C(p, 0.95). 
In other words, if we impose the reasonable constraint (IH]) then, for 1 — a = 0.95 
and large samples, the completely randomized design is better than the two-period 
crossover trial for each crl/al E (0, 11.6263]. This is a complete contrast to the case 
that we are absolutely certain that there is no differential carryover effect. 

5. Implications for finite samples 

By replacing the parameters a and p by their obvious estimators (based on the 
statistics V and W) in the new large-sample 1 — a confidence interval described in 
Section 2, we obtain a new finite-sample confidence interval for 9. This new finite- 
sample confidence interval will have coverage and scaled expected length properties 
that will approach the corresponding properties for the new large-sample 1 — a 
confidence interval as ni + ^2 ^ oo. This suggests that it will be possible to design 
confidence intervals for 9 that utilize the uncertain prior information that there 
is no differential carryover effect for small and medium, as well as large sample 
sizes. This also suggests that the result found in Section 4 will also be reflected 
in small and medium, as well as large samples sizes. We expect that subject to a 
reasonable upper bound on how badly any new flnite-sample 0.95 confldence interval 
can perform relative to the usual 0.95 confldence interval for 9 based on data from the 
completely randomized design, the completely randomized design is better than the 
two-period crossover trial for a very wide range of values of (subject variance) / (error 
variance) . 

Appendix. Proof of Theorem 2.1 

In this appendix we prove Theorem 2.1. 

Proof of part (a). It follows from ([2]) that the probability density function of if, 
evaluated at is (pih — 7). Thus 

/oo ru{h) 
I fG\H{g\h)dg(j){h-j)dh (A.l) 
-00 Je(h) 
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where fG\H{g\h) denotes the probabihty density function of G conditional on H = 
h, evaluated at g. The probability distribution of G conditional on H = h is 



The standard 1 — a confidence interval / has coverage probability 1 — a. Hence 



The result follows from subtracting (1A.3|) from (]A.2p and noting that b{x) = for 
all I a; I > d and s{x) = Ca for all x > d. By a consideration of the distribution of 
(— G, —H), it may be shown that 0(7; b, s, p) is an even function of 7, for given 6, s 
and p. 

Proof of part (b). The result is an immediate consequence of the fact that h{x) = 
for all |a;| > d and s{x) = Ca for all x > d. 
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